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Abstract. We study the shape of the probability mass function of the Markov bi- 
nomial distribution, and give necessary and sufficient conditions for the probability 
mass function to be unimodal, bimodal or trimodal. These are useful to analyze the 
double-peaking results from a PDE reactive transport model from the engineering lit- 
erature. Moreover, we give a closed form expression for the variance of the Markov 
binomial distribution, and expressions for the mean and the variance conditioned on 
the state at time n. 
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1. Introduction 

The Markov binomial distribution occurs in diverse applications. Examples are weather 
forecasting, stock market trends, DNA matching, quality control (cf. [13]), and biometrics 
(cf. [3], see also [4]). In 1924 Markov IT showed that under certain conditions a Markov 
binomial distribution is asymptotically normally distributed. Later in 1953 Dobrusin 
[6] studied some other limit distributions of a Markov binomial distribution. In 1960 
Edwards rediscovered the Markov binomial distribution in connection with work on 
the human sex ratio. More recently many authors studied its distribution and moments 
(cf. [HIISIIII]) and its approximations by compound Poisson distributions and binomial 
distributions (cf. [TJ[2[n])- 

Our interest in the possible lack of unimodality of the Markov binomial distribution 
arose from the paper 12 where the authors deduce from simulations a somewhat sur- 
prising behaviour of double peaking in the concentration of the aqueous part of a solute 
undergoing kinetic adsorption and moving by advection and dispersion. In our paper [S] 
we will explain this behaviour rigorously from the multimodality properties that we derive 
in the present paper. 

Let {Yk,k > 1} be a Markov chain on the two states {S,F} with initial distribution 
v = (i'Sj^f) and transition matrix 



(1) P = 



P(S,S) P(S,F) 
P(F,S) P(F,F) 



1 — a a 
b 1-b 



where we assume < a,b < 1 throughout the paper. The Markov binomial distribution 
(MBD) is defined for n > 1 as the distribution of the random variable which counts the 
number of successes in n experiments with the two outcomes success and failure: 



fc=i 
1 
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We say Kn is a Bin{n, a, b, v) distributed random variable. Clearly the MBD generalizes 
the binomial distribution, where a + b — 1 and (:^Sj i^f) = a)- 

In Section [5] we will give an explicit formula for the variance of a MBD. This was not 
given in ,14^ , and only implicitly in [9l [13] . By introducing the notion of 'excentricity' we 
can write down tractable formulas for the expectation and the variance of a MBD. For 
the application to the reactive transport model we need a bit more, namely the variances 
conditioned on the state of the chain at time n. Expressions for these formulas will be 
computed in Section [31 

In Section [31 we will give a closed formula for the probability mass function /„ of Kn, 
and we study its shape. The probability mass function /„ was implicitly given in [8l[9lll4). 
but the closed formula presented here is helpful to study its shape. Surprisingly, the shape 
can be unimodal, bimodal and trimodal. We show in particular that when a + 6 > 1 the 
probability mass function of Kn is unimodal, and that the probability mass function of 
Kn restricted to the interval [1, n — 1] is always unimodal. 

In Section [5] we give formulas for the probability mass functions of Kn , conditional 
on the state at time n. Here again our interest arises from the fact that in the reactive 
transport model of [12] the authors consider the behaviour of the concentration of the 
aqueous part of a solute, which corresponds to conditioning at the state of the chain at 
time n (aqueous success, adsorbed ~ failure). 

2. The variance of the Markov binomial distribution 

Let (tts, TTp) be the stationary distribution of the chain {Yfe, fc > 1}. We have 

b a 
a + a + 

In fact, diagonalizing P yields for n = 0, 1, 2 . . . 



(2) P" 



TTg TTp 
TTg TTp 



+ 7" 



TTp —TTp 
-TTs TTS 



where 7=1 — a — 6 is the second largest eigenvalue of P. Note that for 1 < A: < n, 



{Yk = S) = i.sP'-'(S, S) + i^FP'^-i(F, S) = 7rs(l - (1 - ^)7'=- 

TTS 

and similarly. 



P. [Yk = F) = i.sP'-'(S,F) + i.FP'=-i(F,F) = 7rF(l - (1 - —h""-^). 

TTp 

It appears thus useful to define the excentricities es and ep of an initial distribution v by 

:= = 1 - — , for re{S,F}. 

Both quantities measure the deviation of the initial distribution v from the stationary 
distribution tt. Using them we can rewrite P^, (Yfc = S) and P^ (Yj. = F) as 

(3) P,(rfe = S)=^s(l-es7''"'), P,.(rfe = F) = 7rF(l-£F7^-i). 

Moreover, the expectation of Kn is given by (note that 7 < 1 since a + b > 0) 



7 



(4) E,[X„] ^Y.^- [l{n-=s}] = I] p. (Yk = S) = TTS ^es- 

fe=i fe=i 
The expectation of Kn is particularly simple if we start in the equilibrium distribution 
since in this case es ~ 0. 

Obtaining yar,y{Kn) is more involved, because of correlations. 
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Proposition 2.1. For any Bin{n,a,b,v) distributed random variable Kn, we have 



■xr (i^ \ I 7rF(l+7) 7(£s(7rs - ttf) - 27rF) - es(7i"F - i^s) , „ 2£s(7rF - tts) 
Vari,(A„) = TTS < n — 1 — h 



1-7 



(1-7F 



1-7 



Proo/. Since Var,y(i4r„) = - (Ei,[Kn])^, using (g]) it suffices to calculate 



71 

k=l 



■{n=s} 



l<i<j<n 



k=l 



= E,[i^„] + 2 p. (5^. - s, - S) . 

l<z<j<n 

Thus we only need to calculate 

(y, = s, = s) = p,(r, = s [ i;; = s)p, (f, = s) = (tts + ^F7'"')7rs(i - £s7^-') 

= 7rs(7rs + 7rF7^"' - £s t^si'^^ - £s t^fi'^^), 
using ^ and ([3]). Performing the four summations we obtain that 

l<i<j<n 



. n(n — 1) / n 

27rs <( TTS h 7rF7 



/n 1 — 7"\ /n 1— 7"\ 

VT^ " (1 -7)2j ~ "^^""^ VT^ " (1-7)2; 



esTTF 



(t 



nr , 7(1-7") 

-7 (1-7)2 



= TTS <^ n(n - 1 TTS + 2n h 2n7 1-2(1-7 ) ^ 

I 1-7 1-7 (1-7) 

which, combined with completes the proof of the proposition. 



□ 



3. The conditional variance of the Markov binomial distribution 

Here we are interested in the variance of Kn given the state of the chain at time n. 
Let be the random variable Kn conditioned on K„ = r G {S,F}. For completeness, 
we will first give the corresponding means E^[K^] and E^if^j ^^^^^ 

were also given in 

[Sllilll]. Using © and we obtain that 



E, [K^] = E,[Kn I Yn = S]=Y, MYk - S I y„ - S) 



\ P,{Yn = S I Ffe = S)P, {Yk = S) _ ELi ^""'(S, S)P, {Yk - S) 



(5) 



E 

k=l 



P. {Yn = S) 



7i's(l - £S7" ^) 



ELi(^S + 7rF7"~')7rs(l - £S7'"') 
7i's(l - es7"~^) 
TTS - £s7rF 7"~^ (ttf - £s7rs)(l - 7") 
l-es7"-i (l-7)(l-es7"-i)' 
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and similarly, 

^' ^ "J 1-£f7"~^ (1 -7)(1 -eF7"-i) 

Proposition 3.1. The variances of K^, a Bin{n,a,b,i>) distributed random variable Kn 
conditioned on Yn — t G {S,F} , are given by 

7rF^s(l + 3£s7"'') ^ £s7ri+^g7"-27rF7rs(l + £s7"'') 
1-£S7""^ (1 -7)(1 -£S7"~^) 



+ (l-7")( 
and 



7rF7rs(4 + £s) - (ttf + £s7r^) ^ gs + Trg - 27rF 7rs(l + £s) 
(1 - 7)(1 - £s7"-i) (1 - 7)^(1 - es7"-^) 



^ 1-£f7"-i V 1-£f7"-i (1 -7)(1 -£f7"~i) 

7rF7rs(l + (2 + £F)7"'') ^ + £f 7rg7" - ttf 7rs(l + £f)(1 + 7"'') 
1-£f7""1 (1 - 7)(1 - £f7""1) 

„^ / 7rF7rs(4 + £F) - (tts +£F7rg) 7r| + £f Trg, - 27rF 7rs(l + £f) \ 
^ ^ '\ (l-7)(l-£F7"-i) (1- 7)2(1 -£F7"-i) 

Proof. Since the calculation of Va.T^(^Kf^) is similar to Var^(iir^), we only deal with 
Var^(is:,^). Note that Var^^Sj = E^[(Vs)2] - {E^[K^])^. Using © it suffices to 
calculate 



L fc=l 



2 



k=l l<i<j<n 

= Eg[X„] + 2 J2 PAY^ = S,Y, =S\Yn = S). 

l<'i<j <n 

It follows from © and © that 

-p _ c V _ Q I V _ c\ _ — s) Piy(yj = s, y„ = s I = s) 
p.(y. - s, - s I y„ - sj - (r„ = s) 

^ P.(y, = S)P-'-'(S,S)P"-J(S,S) ^ (1 - £sy'^)(^s + ^F7^""')(^S + 7rF7"'^) 

p. (Yr, = S) 1 - £S7"-1 

_ 7rg7"~" - 7r|£s7'~^ 7r| - 7rg£s7"~^ 7rF7rs(7"~-^" - £s7^~^) 
~ 1 - £s7"-i 1 - £S7""^ 1 - £S7""^ 

7rF7rs(7^"^ -£S7"^^~^^"^) 
^ 1 - £S7""^ 
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Performing the eight summations in the above equation we obtain that 

l<i<j<n 

2(1 - 7"). - 2n + 



(1 - es7"-i)(l - 7)2 (1 - £S7"-^)(1 - 7) 

n(n-l)(7r|-7rg£s7"~^) 27rs7rF ( ^ } + ^^^^ q ,m 1 + gs7 \ 
1 - es7"-i 1 - es7""^ V 1 - 7 ^ ^ (1 - 7)2 / 



2™ (,7+£s7ri_(^_ „)7+£s X 
1 V 1-7 ^1 - 7 2/ 



l_egy.-iV 1-7 (1-7) 
. ,, ^i-7rg£s7"'^ , , 7rs^F(l + 7)(1 + £s7""^) - (^j^S + ^g7") 
^ ' l-es7""' (l-es7"-i)(l-7) 

,r,f._ ^s^S + 7rg7 - 7rs7rF(l + 7)(1 + gs) 
^ ^' (l-es7"-i)(l-7)' 
which, combined with ([S]), yields the expression for Var,y(if^). □ 

For the special initial distributions (0, 1) and (1, 0), we have the excentricities £3 ((0, 1)) — 
1 — eF((l, 0)). Substituting them in equations ([5]), © and Proposition l3.1l we obtain that 

Ep [Kf,] = Eg [<] , Varp (K^) = Var^ (JC^) , 

where 

Ep :=E(o,i), Eg :=E(i_o), Var^ :=Var(o,i), Var^ := Var(i_o)- 
More generally we have the following. 

Proposition 3.2. For any Bin{n, a, b, v) distributed random variable Kn and any positive 
integer m, the m"^ moment of conditioned on Yi = F is equal to the m*'^ moment of 
conditioned on Yi = S, i.e., for m = 1, 2, . . . 

Proof. Note that for m < n 

fc— 1 k—1 ii<i2 

H \-Cm X l{Fii=S,Fi2=S,...,yi„=S}i 

il<i2<---<im 

where the Ci's are constants related to n and m. This implies that for t G {S, F} 

n 

MiKr] ^r{Yk = s I r„ = r) + C2 ^ p^(r,, = s, y., = s | y„ = r) 

fe=l ii<i2 

+ • • • + c„i ^ ^ Pf (y^i = s, Yi^ = s, . . . , = s | y^ = r), 

il <i2 <---<'im 

where S = F, F = S and Pp := P(o,i)'Ps := P(i.o)- 

Thus we only need to show that ioi 1 < ii < ■ ■ ■ < ik < ri, 

(7) PF(yi =s,...,y, = s|y„ = s) = Ps(y„-^,+i = s,...,y„_,,+i = s|y„ -f). 
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It is easy to see that both sides of Equation ([7]) equal if ii = 1. Now suppose ii > 2. 
Since {Yk, fc > 1} is a homogeneous time reversible Markov chain, we have 

FviY,, = s,...,y,, = s|r„ = s) 
^ PF(l;^ - s I y,, ^ s)PF(y., ^ s I = s) ■ ■ ■ Pf^, ^ s | i-, ^ s)Pf{y^, ^ s) 

PF(r„ = s) 

_ P"-'" (S, S)P'^-"°-i (S, S) ■ ■ ■ P'^-'i (S, S)P'i-i(F, S) 
^ P"-i(F,S) 

P'^-^'-CS, S)P*'=-*'=-i (S, S) • • • (S, S)f^P*i^i(S, F) 
= fP"-HS,F) 

= Ps(^n-ife+l = S, yn-ifc_i + l = S, . . . , = S | = F), 

which yields Equation ([7]). Thus the proposition is established for m < n. In a similar 
way, one can show that the proposition holds for all m > n. □ 



4. The probability mass function of the Markov binomial distribution 

For any Bin{n,a,b,i>) distributed random variable Kn, we will give sufficient and 
necessary conditions for the probability mass function of Kn to be unimodal, bimodal or 
trimodal. These three kinds of shapes are mentioned by Viveros et al. .14, without any 
further explanation. 

Given n > 1, let /„ be the probability mass function of Kn, i.e., 

fn{j)^F,{Kn=j). 

Particularly, fn{j) = if < or j > n. By an easy computation, 

fn+2{j + 1) = P,v {Kn+1 =J + l, Yn+1 - F) P(F, F) + P, {K^+l - J, Yn+1 = F) P(F, S) 
+ P„ (Kn+l = J + 1, Yn+l = S) P(S, F) + P, {Kn+1 = j, ^n+l = S) P(S, S) 
= fn+l{j + 1)P(F, F) + P, {Kn+1 - j + 1, Yn+1 - S) (P(S, F) - P(F, F)) 

+ /„+i(.?)P(S, S) + P, (i^„+i = J, r„+i - F) (P(F, S) - P(S, S)) 
= P(F, F)/„+i(j + 1) + P(S, S)/„+i(j) + (P(S, F) - P(F, F))/„(j), 

where the last equality holds since 

P(S, F) + P(S, S) P(F, F) + P(F, S) = 1. 
Substituting ([T]) in the above recursion equation yields that for n > 1 
(8) fn+2{j + 1) = (1 - b)fn+i{j + 1) + (1 - a)fn+i{j) - {1 - a - b)Ui) 

with initial conditions 

/i(0) = ^F, /i(i) = i^s; 

^' /2(0) =i.F(l-&), /2(l) = M + i^sa, f2{2) = vs{l-a). 

In [U m [14] (implicit) expressions for the probability mass function of Kn are given, but 
the closed form presented here is more helpful to study its shape. 
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Proposition 4.1. The probability mass function fn of a Bin{n, a, b, v) distributed random 
variable Kn can be written as 



fnU) = { 







J=0, 

1 < j < n- - 1, 

j = n, 
otherwise, 



where 5 ~ abj ((1 — a)(l — fe)) and 



n — 2 — j\ vsa + vpb / n — 2 — j\ i/pab f n — 2 — j 



k-l 



1-6 



(1-5)2^ k + 1 



Proof. It is easy to see that the recursion equation ([8]) with initial conditions ^ has a 
unique solution. We only need to check that /„ presented in the proposition satisfies the 
equations ([U and ([9|), and that the summation of fn{j) from j = to n equals 1. It is 
easy to see that (jS]) holds for j < and j > n. Equation (jS]) holds for j = since 



(1 - 6)/„+i(l) + (1 - a)/„+i(0) -{I -a- 6)/„(0) 
= (1 - 6)"+ico,o(n + 1) + vf{1 - a)(l - - i^F(l -a- b){l - b^^^ 
= (1 - 6)""^((1 - b){i^sa + i^Fb) + {n- l)i^Fab + lypi^ - a)(l - b) - i^f(1 - a - b)) 
= {l-br-\{l-b){i,sa + i^Fb)+ni,Fab) = /„+2(l). 



Similarly, Equation ([8]) holds for j = n. 

Suppose now 1 < j < n — 1. From simple properties of the binomial coefficients in 
Cj^kin) it follows that 



Cj_i,fc(n) = Cj,fe(n + 1) = Cj+i^kin + 2), 



and 



(10) 



Cj,fc(ri + 2) = Cj+i^n + 2)+ Cj+i,fc_i(n + 2). 
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We write Cj^k '■— Cj,k{n + 2) for short. Thus 

(1 - + !) + (!- a)U+i{j) - (1 - a - 6)/„(j) 

j 



(i - by+^-^i - ay 



■E 

fe=o 



k=0 



E 

.k=Q 

3-1 
k 



(1 - br+'-^i - ay J2 ; + (1 - br+'-^i - ay J2 



k=0 



(l-<5)(l-6r+i-^-(l-a)^-^ 



k=0 



i-1 
k 



5^c 



■j + l.k 



J -I 



k=0 



k=0 



J-1 
k 



(1 
(1 



.k=0 

j 



fe=0 

j 



k=0 



J-1 

A: 



.k=0 

j 



A;=0 



J-1 



(1 - br+'-^i - ay J2 ijs'c.^k - /„+2(j + 1) 



k=0 

Now we are gomg to show by induction that X]j=o fn{j) = 1 for each n > 1. For n = 1 
and 2, we have /i(0) + /i(l) = i^f + >^s = ^, and 

/2(0) + /2(1) + /2(2) = i^f(1 -b) + VYb + z/ga + - a) = 1. 
Suppose /„ and fn+i are probabihty mass functions, then by Equation ([8]) 

ri+2 n+2 ri+2 n+2 

E = (1 - E + (1 - ^) E - 1) - (1 - a - E /"(^' - 1) 

i=0 J=0 j=Q 3=0 

= (1 - a) + (1 - 6) - (1 - a - 6) = 1. 
This completes the proof. □ 

Example 4.1. Let n = 200, a = 0.01,5 = 0.03 and = (0.1,0.9). By Proposition \4l\ 
we obtain the probability mass function of K2Q0 shown in Figure Ql Apparently /200 
trimodal. 

A finite sequence of real numbers {xi}f^Q is said to be unimodal if there exists an 
index < n* < n, called a mode of the sequence, such that a;o < xi < • • • < Xn* and 
Xn* > Xn*+i > • • • > 2^n- In particular, we call the sequence {xi}^^Q strictly unimodal if 
all modes n* satisfy < rt* < n. From the definition it is easy to see that a monotonic 
sequence is unimodal. 

A nonnegative sequence {xiY^^q is called log-concave (or strictly log-concave) if Xi-iXi+i < 
x^ (or Xi-iXi+i < x^) for all 1 < i < n — 1. It is well known that the sequence {xi}f^Q 
is log-concave if and only if Xi-^^iXi^^i < Xi^Xi^ for all 1 < ii < «2 < ~ 1- Moreover, 
log-concavity implies unimodality. 

The definitions of unimodality and log-concavity can be extended naturally to infinite 
sequences. 
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0.015 - 



0.010 - 



0.005 - 



0.000 




Figure 1. Probability mass function /200 of if 200 with a — 0.01, b ~ 0.03 
and ly = (0.1,0.9). 

Proposition 4.2. Let a + b > 1, and let /„ be the probability mass function of a 
Bin{n,a,b,i>) distributed random variable Kn- Then the sequence {fn{j)}^=o ^'^d' 
concave, and hence unimodal. Moreover, themoden* satisfies [Ey[ii'„]J < n* < [E,y[iir„]]. 

Proof. Let Gn be the generating function of Kn, i.e., for all real s 

n 

Without loss of generality we suppose < < 1. Then (by Proposition 14. ip G„ has 
positive coefhcients. It follows from the recursion equation ([8]) that 

G„+2(s) = ((1 - a)s + (1 - 6))G„+i(s) - (1 - a - 6)sG„(s). 

Since a + b > 1, we obtain by Corollary 2.4 of [TO] that for each n > 1 all zeros of G„ 
are real. Thus the sequence {/n(j)}"=o log-concave and hence unimodal with mode n* 
between LE,.[ii:„]J and ['E^Kn]]. □ 

When a + b < 1, Figure [1] suggests that the probability mass function {fn{j)}"=o is 
not unimodal. However, Figure [T] also suggests that {/n(j)}"=i is unimodal. We will 
indeed show in Proposition 14.31 that the sequence {fn{j)}"Zi is log-concave, implying 
unimodality. In order to prove Proposition 14.31 it is helpful to use the following lemma 
which can be derived directly from Lemma 2.2 and Proposition 2.4 of [TO]. To be more 
self-contained, we give a proof by using simple properties of binomial coefficients and 
log-concave sequences. 

Lemma 4.1. For any positive integer j and a nonnegative log-concave sequence {xk}k, 
let dj^k ■= ikj^k- Then for any < 21 < m < 2j, 

Lm/2J 

J2 DjA^) > 0, 

where for k < m/2 

Dj.ki^') = 2dj,k dj^m-k — dj-l.k rfj + l,m-/c — dj+i^k dj-i^m-k, 

and for m even and k — m/2 

Dj^kim) — dj k ^ dj-i^k dj+i^k- 
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Proof. Note that for fc < m/2 



J 

k J \m — k 



j-l\fj + l\ fj + l\fj-l 



fc— l/Vm — fc/ Vfc — 1/\TO — fc 



m — k^ 



k J \m — k 

•^k-^rn — k 



^k-^7n—k 



fc 



j\ ( J - 1 



yUi — k — 1 J \k J \m — fc — 
For brevity, we only show the lemma for m odd. Let m = 2s + 1. Then for < £ < s < j, 

3 



Y^D,,k{m)^ ^ [ 



k=e-i 



fc 



fc 



k=l 
s-1 



k=e 



k 



TO — fc — 1 
j 

m — fc — 1 
j 

I - fc - 1 



m — fc — 1 



-fc- 1 
-fc-1 



Xk+lXm-k-l 



•^k-^m — k 



{•^k+l-^m—k—l •^k-^m — k^ 



m — ij \l—\l\ra — l 



XlXm-l > 0, 



where the last inequality holds since j.^) (j^j'j..^) > (fc) for fc < s — 1 and the 

sequence {xk}k is log-concave. This completes the proof of the lemma. □ 



Inspired by the proof of Theorem 3.10 of [16], we are going to use Lemma [4. II to show 
the log-concavity of an important class of sequences. 

Lemma 4.2. Let 6 > Q and {cj^k}j,kei, be an nonnegative double sequence satisfying 
and Cj k = for all j £ Zi and fc < —2. Then the sequence 



fe=o 



is log-concave. 

Proof. We fix j > 1. Let dj^k '■— (i)^''- We have to show that Zj > Zj_iZj+i where 



J2 (^'^'^Ci^fe =H'^J.'=^J■'=■ 
fe=o 



fc=0 



We use the short notation Vk :— Cj+i^fc. Since Cj,k — Cj+i,fc + Cj+ijt_i, this yields 
Zj+i = ^dj+i^kVk, Zj = ^dj,fe(wfc -|-Ufc_i), Zj_i = ^dj_i,fe(wfc + 2vk-i +Vk-2). 

fe=0 fc=0 fc=0 
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Note that Vk = Cj+i,fc — for all j and k < —2 and dj^k = for fc < or fc > j, by the 
definition of (^.) . Rewrite 

j+2 3+2 

Zj+i = ^ Zj = ^ {dj,k-i +dj^k)vk~i, 

k=Q k=0 
3+2 

Zj-i = ^ {dj-i^k-i + 2 dj-i,fe + dj^i,k+i)vk^i- 

k=Q 

Then z|— can be rewritten in a quadratic form of j+3 variables w_i , wo, wi , . . . , Wj+i : 

2(j+2) Lm/2J 



^ ^ ej^kim)vk-iVm-k-i, 



m=0 k=0 



where 

Gj.ki'm) = 2 (dj^k-i + dj,k) (dj,m-fc-i + dj,,n-k) 

— {dj-l,k-l + 2rfj-l,fc + rfj_i^i;+i)rfj-|_i_m_fe_i 

— rfj + l,fe-l {dj-l.m-k-l + 2 dj-i^m-k + dj-i,m-k+l) ■ 

Since the w^'s are all nonnegative, it suffices to show that J2k2o^^ E-j.k{m) > 0, for all 
< TO < 2(j + 2). Rewrite 

ej^k{m) ^ Pk + 2Qk + Rk, 

where 

Pk = 2dj^k-l djjn-k-l — cJj"-l,fe-l dj"+l,m-fc-l — rfj"+l,*:-l c'j-l.m-fc-l , 

Qfc = dj^k-l dj^m-k + C?j,fc dj^„i-k-l — dj^i^k dj+ijn-k-1 ^ o'j+l.fc-l C^j-l.m-fe, 

-R/c = 2 dj,k dj,m-k — dj-i,k+l dj^i^rn-k-l ^ C?j + l,fc-l C?j-l,m-fc+l- 

Then we only need to show that 

[m/2j L»»/2J ["i/Zj 

^ Pfe > 0, ^ Qfe > 0, ^ i?fe > 0. 

fe=0 fc=0 k=0 

For brevity, we show this only for the case to is odd. For m even the proof is very similar, 
but somewhat longer. Let to = 2s + 1. It follows from Lemma BT] that 

s s s — 1 

Y,Pk = Y. Dj.k-iim ~2) = Y, Dj.,k{m - 2) > 0, 

fc=0 fc=0 k=0 

where the second equality holds since Dj^kifn — 2) = for fc < 0. Recalling from Lemma 
14. II that Dj,s{m — 1) — d^ ^ ~ rfj-i.s dj+i^s we also have 

s s — 1 s 

Qk = {d-j,k dj^m-k-1 — dj+i^k rfj-l,m-fe-l) + ^^(t^j./c dj^m~k-l — dj^l^k dj"+l,m-fe-l) 

fe=0 fe=-l fe=0 



^i?,-,(TO-l) >0. 



fc=0 
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Moreover, 

s s s+1 



k=0 k=Q k=l fe=-l 

s s 

fe=o fe=o 
This finishes the proof of the lemma. □ 

Proposition 4.3. For any Bin^Uya^bji') distributed random variable K„, let /„ be its 

probability mass function. Then the sequence {fnU)}^=i log-concave. 

Proof. According to Proposition 14. 1 1 we have that for 1 < j < n — 1 

fnij) = (1 - hr-^{i ay-' J2 P fc O'^'^.-i.'o' 

fc=0 ^ ^ 

where (5 > 0, and the double sequence {cj_k}j,kez satisfies the recursion equation 

^j,k — Cj+l,k + Cj+l,k-l 

(cf. Equation (IIOI) ). and Cj^k = for fc < —2. It follows from Lemma that the sequence 
{/nO')}"=i is log-concave. □ 

In fact we can show by sharping the proof of Lemma [4.21 that {/ri(j)}"^i is strictly 
log-concave, i.e., > fn{j — l)/nO -I- 1) for j = 2, . . . , n — 2. Proposition 14.31 implies 

that the shape of the probability mass function of Kn, which can be unimodal, bimodal 
or trimodal, is determined by the following six values: 

/„(0), /„(!), /„(2), /„(n-2), /„(n-l) and /„(n). 

Theorem 4.1. For any Bin{n, a, h, v) distributed random variable Kn, let /„ be its prob- 
ability mass function. Then fn is unimodal, except that 

fn is bimodal with one peak on the left if and only if fn{0) > /«(!) < /ri(2) and either 
fn{n - 1) > fn{n) or fn{n - 2) < - 1) < fn{n); 

fn is bimodal with one peak on the right if and only if /n(ri — 2) > fn{n — 1) < fnin) 
and either /„(0) < /„(l)or /„(0) > /„(!) > /„(2); 

/„ is trimodal if and only if /„(0) > /„(!) < /„(2) and fn{n - 2) > fn{n - I) < /,i(n). 

Example 4.2. We consider the special case v = tt = {b/{a + b),a/{a + b)) and n = 50. 
It follows from Proposition \4-l\ that 



UO) ^ (1 - ^)"-^, ^ il^f-!^(2 + in - 2)-^) , 

a-fo a-\- b \ 1 — 0/ 



{l-b)"-^ab. 

a -f 
and 

, , {\ — h)'^~'^ ab ( , f , , a \ / , ,2a , a -.2 ( n — 'S\\^ 
/42) = ^-^^{(l-a)(2 + (n-3)^)+5(l + (n-3)^ + (^) ( ^ j)}. 

In a similar way one obtains the formulas for fn{n — 2), /n(n — 1) and fn{n). Figure\^ is 
obtained via Theorem \4.1\ For some examples of probability mass functions in this class, 
see Figure\^ 
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Figure 2. (I) When (a, b) is in the gray region, /50 is strictly unimodal; 
when (a, b) is in the lower light gray region, /50 is decreasing; when (a, b) 
is in the upper light gray region, /50 is increasing; (II) When (a, b) is in 
the dark gray region, /50 is bimodal with one peak on the left; when (a, b) 
is in the black region, /50 is bimodal with one peak on the right; (III) 
When (a, b) is in the white region, /50 is trimodal. 



0.10 




10 20 30 40 50 10 20 30 40 50 



Figure 3. Probability mass function of iiTso with = tt. In the upper 
left graph a = 0.3, 6 = 0.5 and in the upper right graph a = 0.05, b = 0.2. 
In the lower left graph a = 0.09, b = 0.05, and in the lower right graph 
a = 0.02, 6 = 0.5. 



5. The conditional probability mass functions 

For any Bin{n, a, b, u) distributed random variable Kn, let be the probability 
function of K'^ with r G {S, F} , i.e., 

flij) = P. {Kl = j) = P. {Kn = j I y„ = r) . 
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In order to deal with it is simpler to deal with the partial probability mass functions 

= {K^ = J, Y^^t)^ /,I(j)P, (y„ = r) . 
Since = fn ^ fn^ only deal with f^. It is easy to obtain the recursion equation: 

/„'+2(j + 1) = (1 - + 1) + (1 - «)/,^+i(.7) - (1 - a - 6)/„'(j), 

with initial conditions 

/T(o) = o, /?(!) = i^s; 

/l(0)=0, /|(1) = J^F&, /|(2)=j.s(l-a). 
Then we obtain the following proposition in a similar way as Proposition |4?T] 



Proposition 5.1. The partial probability mass function of a Bin{n, a, b, v) distributed 
random variable Kn can be written as 

(1 - - ap-i Jn) 1 < J < n - 1, 

otherwise, 



where S = ab/ ((1 — a)(l — 6)) and 



n — 2 — A i/F& / n — 2 — j 



k-1 l-b\ k 



From Lemma it follows that the sequence {fn{j)}j'Zo log-concave, and hence 
{/n(j)}J=o^ is log-concave. Thus, in contrast to /„, can not have a trimodal shape. 
The unimodal or bimodal (with one peak on the right) shape of depends on the values 
of fnij) for J = n — 2, n — 1, n. 

Similarly, the shape of can only be unimodal or bimodal (with one peak on the left) 
depending on the values of /^(j) for j = 0, 1, 2. 
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